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1 .  INTRODUCTION 


A  number  of  Important  recent  advances  in  econometric  theory  are 

related  to  the  methods  of  truncated  regression  model  -  the  regression 

I 

model  in  which  jthjie  range  of  the  dependent  variable  is  restricted  to  some 
Interval  of  ( -*,«!>),  usually  the  non-negative  half-line,  such  as  the  income 
of  an  Individual.  Powell  [6 J>  £7]  used  the  Lj-norm  criterion  with  some 
modifications  in  estimating  the  regression  coefficients  in  truncated  linear 
models.  He  proved  the  consistency  and  asymptotic  normality  of  his  estimates 
under  a  set  of  conditions.  On  the  other  hand,  Nawata's  paper^tSj-  uses  the 
ordinary  L^-norm  (least  square)  criterion,  along  with  a  grouping  and  adjust¬ 
ment  of  the  observed  data.  In  his  view,  his  method  has  the  merit  of  easy 
computation  compared  with  the  method  of  Powell. 

In  this  paper  we  borrow, the  basic  Idea  of  Nawata  In  grouping  and  ad- 
justing  the  observed  data.  But  we  shall  make  simplifications  In  the  proce¬ 
dure  of  grouping,  which  enables  us  to  make  substantial  extensions  of  the 
results  of  £6}  under  weakened  conditions.  ^  J  '-'r  •' '  1  1  *  'r' 


<l  '»!  il,  |  .  *i  ili.-ii.  il.  »*,  ii. 


2.  ESTIMATION  OF  PARAMETERS  IN  NON-TRUNCATED  CASE 
2.1.  Assumption  of  the  Model 

Let  {Xj.Yj),  ....  (Xn,Yn)  be  lid.  samples  drawn  from  a  R*W  -valued 
random  variable  (X,Y).  Denote  by  m(x)  the  median  of  the  conditional  distri 
bution  of  Y  given  X  *  x.  We  suppose  that  the  conditional  distribution  func 
tion  has  a  form 


P(Y  <  y | X  *  x)  =  F(y  -  m(x)) 


(2.1) 


where  F  is  a  fixed  distribution  function  which  is  not  assumed  to  be  known. 
Under  this  assumption  we  can  give  Y^  a  convenient  expression  as  follows: 


Yi  *  m(X.)  +  e.,  1  -  1 . n 


(2.2) 


where  ej ,  ....  en  are  lid.  with  common  distribution  F,  and  X] . X  , 

el*  •••»  en  are  mutually  independent.  The  probability  measure  of  X  will 
be  denoted  by  y.  In  this  section  we  make  the  following  assumption  concern¬ 
ing  F  and  y.  Further  assumptions  will  be  introduced  when  needed. 

1°.  F(0)  *  1/2,  f(x)  *  F*(x)  exists  in  some  neighborhood  of  0, 

f ( 0)  >  0  and  f * (0)  exists. 

2°.  V  *  COV(X)  exists,  and  V  >  0. 

3°.  y  has  no  singular  component.  If  y  has  an  absolute  continuous 
component  with  density  g(x),  then  for  sufficiently  small  a  >  0,  there 
exists  an  open  set  G.  such  that  the  symmetric  difference  between  G.  and 
{x:  g(x)  >  a)  has  Lebesgue  measure  zero. 

In  this  section  we  assume  that  the  median-regression  function  m(x)  has 
a  linear  form 


m(x)  *  o  +  B'x 


(2.3) 


4 


i 


i 


i 


3 


and  the  problem  is  to  estimate  the  parameters  a,  B,  using  the  samples 


(Xi.Y1),  i  =  1, 

We  shall  use  ||a||  to  denote  the  Euclidean  length  of  vector  a,  and  a^ 
to  denote  the  u-th  coordinate  of  a.  If  A  is  a  vector  or  matrix,  we  use  |A| 


to  denote  the  maximum  of  the  absolute  values  of  the  elements  of  A. 


2.2.  The  Main  Result  of  Section  2 

1  1  "ei 

Choose  ej  e  (0,  -jj),  e2  e  (-j,  1  -  dej) ,  =  n  ,  cQ  >  0.  Decompose 


R  into  a  set  J*  of  supercubes  having  the  form: 


{(xm,...,x(d)):  a.£n  <  xv*'  <  (a.  +  !)£,,  i  *  l,...,d). 


a^  »  0,  ±1 ,  ±2,  i  *  1 ,.. . ,d. 


(2.4) 


For  J  e  JjJ,  use  #(J)  to  denote  the  number  of  elements  in  the  set 


jn{X1,...,Xn).  Write 


{J:  J  e  J*,  #(J)  >  cQn  z)  =  {Jni . Jpc  } 


(2.5) 


We  have 


-i  1_e2  dere' 

cn  -  c0  n  i  n 


(2.6) 


for  some  e'  >  0,  when  n  is  large.  Further,  write 


dni  ^  (Xi »•  •  •  ,X„)  *  (Xni(l) . X^n^}. 


By  definition. 


n-j  L  con  »  *  *  1 


(2.7) 


We  shall  write  Yn^(j)  and  eni ( j )  for  Yk  and  ek,  when  Xni(j)  »  Xk>  Put 


5 


Xni  "  J,Xn1(j)/ni 

J  • 

vnt  “  n>ed(Vnl(l) . Yn^(n1)) 

eni  *  raed(eni(l) . „,<»,)) 


*  n.  +  n-  +  ...  +  n^ 
n  l  2  c. 


(n) 


Xn  =  1|1  niXnl/Nn-  Vn  *  j/i'/l •  V  ^Vnl^n 
(xni^n . K  -*J\  Y,„,  =  (V,4 . Y_)',  e,„,  =  (e 


cn  n 


(n)  "  ni 


(n)  "  vcni 


'X1' 


H„  •  d1ag(n. . nc  ), 


Pn  ■  X(n)H„X(n)- 


Define 


■  6  *  Pnlx(n)V(n)-  %  *  *  +  ^  ‘  "6„>  +  en 


(2.8) 


.  *(k)  *(|r)  . 

and  (<*„  •  y) »  k  *  0,  lf  ...»  by  the  following  induction  process.  Set 

(2.9) 


»i0)  *  pnlx(o)WnY(n)*  ^  *  \  * 


which  is  the  solution  of  the  weighted  least  squares  problem. 

c 


ii1n1<Yni  -«-x;,*)Z-»1n!. 


Suppose  that  and  aj^  have  already  been  defined.  Put 


Y^+1)(j)  *  Yni(j)  -  (Xn1(j)  -  Xni)'8jk),  j  *  1 . n,  (2.10) 


w 

1 

« 

i 

< 


ranmmmnn 


Y<J+1)  ■  med(Y<|f+1,(j):  J  •  1„ 


(2.11) 


Y(k+1) 

n 


,(k+l)  . 

f(n)  - 


1=1 

fv(k+l)  v ( k+1 ) x 

'Tnl  *’***Tnc  > 

n 


and  then  define 


(k+1)  _  -1  ,  u  y(k+l) 
n  •  pn  X(n)WnY(n) 


s(k+l)  =  Y(k+1)  .  j  dk+1) 
n  n  n  n 


(2.12) 


which  is  no  other  than  the  solution  of  the  weighted  least  squares  problem 

c_ 


(k+1)  „*2 


a  -  X^B)  =  min!. 


The  Y^+1^(j)'s,  defined  in  (2.10),  is  an  "adjustment"  of  the  original 
observation  Y^(j)  of  the  dependent  variable  Y.  For  if  we  know  B,  we  would 
set  Y*^(j)  =  Yni(j)  -  (Xn^(j)  -  xnj)'B,  an<*  9et  the  exact  model  Y*^  = 

°  +  Xnie  +  eni’  1  =1.  .—  V  This  kind  of  adjustment  was  introduced  by 
Nawata  [5],  who  used  it  to  make  a  "first  stage"  estimate  of  a,  6,  which 
are  used  to  form  a  "second  stage"  estimate  of  a,  B,  in  case  that  the  depen¬ 
dent  variable  Y  is  trucated.  We  shall  use  this  idea  in  the  next  section 
also.  The  present  work  differs  from  that  of  Nawata 's  in  some  important  re¬ 
spects.  First,  the  decomposition  of  the  range  of  independent  variable  is 
greatly  simplified,  and  the  conditions  imposed  on  this  decomposition  is  very 
simple,  as  compared  with  the  very  complicated  one  introduced  by  Nawata. 
Second,  we  allow  the  number  of  sets  in  the  decomposition  to  go  to  infinity, 
which  is  conceptually  reasonable  and  enables  us  to  reach  the  optimal  covar¬ 
iance  matrix  of  the  limit  distribution.  Third,  we  do  not  assume  that  the 


range  of  the  Independent  variable  is  bounded.  Fourth,  the  number  of 
iterations  in  our  iterative  process  has  a  predetermined  bound  (see  Theorem 
1  below),  while  in  [5]  this  number  is  indefinite.  From  a  practical  point 
of  view,  it  is  not  reasonable  to  define  an  "estimate"  by  infinite  number 
of  iterations. 

Now  we  state  the  main  theorem  of  this  section: 


THEOREM  1.  Choose  an  integer  r  such  that 


rcj  <  1/2  <  (r  +  1 )&2 . 


(2.13) 


Then  under  the  conditions  stated  in  Section  1,  we  have 


/n 


-(r+1) 

Q.~")  -  0 


NfO.A’Vf^O)) 


(2.14) 


‘(r+1) 


|an  '  '  aJ  '  V" 


-1/2’£1)  .  |^1) 


-  e. 


(2.15) 


where  A  =  (A..)  is  a  (d  +  1)  x  (d  +  1)  matrix,  with 

*  J 

loo  ■  *•  xoj’xjo  =  Ex(J)-  X1  j  -  ECX(,>X(J>).  1 .  j  =  1 . d. 


(2.14)  means  that,  as  an  estimator  of  (a, 6),  (a^1"4^,  ^r+1h 
an  asymptotically  optimal  covariance  matrix. 


possesses 


2. 3.  A  Lemma 

The  proof  of  Theorem  1  depends  on  a  limiting  theorem  concerning  the 
linear  forms  of  (e  . , . . .  ,enr  },  which  we  consider  separately  in  this  sub- 

n  i  riv 

n 


section. 


LEMMA  1.  Let  Cj,  c^,  ...  be  natural  numbers  such  that 


lim  c  =  0. 
n-*»  n 


(2.16 


For  each  n,  give  a  set  of  iid.  variables  {ej"):  j»l,...,n1#  i=l,...,cn>. 


n.  +  n„  +  . ..  +  n„  <  n 
12  cn - 


lim(/5Tlog  n)/min(n.  ,...,n  )  =  0. 
n-*»  cn 


(2.17 


(2.18 


Assume  that  the  distribution  function  F  of  e^"'  does  not  depend  on  n,  and 
F  satisfies  condition  1°  of  Section  2.1.  Let  a  .(j):  i=l,...,c  ,  j*lf... 


be  constants  satisfying  the  following  conditions: 


ijVni'j’  *  0 


»  j  s  1 » •  •  •  |T i  n  =  1 ,2 1  •  •  • 


ii 

lim  l  Vni(jVani(j2)/n  =  xi  i 

n+~  i=l  1  01  1  01  c  J1J2 

exists  and  finite  for  jj.jg  =  l»...,r. 


(2.19 


(2.20 


Define  e|n^  =  med(ej"!. . .  ,ejj^) .  i  B  l,...,cn,  and 


?nj  =  XVni^^i"^'  j  =  1 . .  5n  =  ^nl . 5nr^'*  *2,21 


Then  we  have 


5n  Nr(0,A/4f2(0)) 


as  n  «,  where  A  is  the  matrix  with  elements  X.  .  , 

J1J2 


(2.22 


aware 


g» 


Proof.  Consider  first  the  case  r  =  1,  and  write  for  simplicity 

Given  6  >  0.  By  the  assumption  made  on  F,  we  have  F(<5)  >  1/2.  Using 
an  inequality  of  Hoeffding  [4],  we  get 

ni 

P(efn)  >  5)  <  P(hp  l  Ke!"5)  >  4)  -  (1  -  F(5))|  >  F(5)  -  h 
1  "i  j=l  1J  ”  2 

<  2  exp(-  n.(F(6)  -  1/2) 2/3) . 

From  this  and  (2.18),  we  have 

P(e!n^  >  6)  <  exp(-  /n),  i  =  l,...,cn 
for  n  large.  Similarly  it  is  shown  that 

P(e|n^  <  -6)  <  exp(-  /n),  i  =  l,...,cn 
for  n  large.  Hence  for  nQ  large  we  have 

i  Cf  i  «)  «  !  JZe-*  < 

n-nn  i-i  n-On 


Therefore,  wpl  (with  probability  one)  we  have 


|e;.n;|±6,  1*1 . cn 


(2.23) 


for  n  large. 


Denote  by  (U^.:  i=l,2,...,  j*l,2,...}  a  family  of  iid„  random  variables 
with  common  distribution  R(0,1),  and 

u|n)  =  «ed(U11,...,Ui  ),  i  =  1 . cn. 

By  assumption  on  F,  the  inverse  function  F~^  exists  in  some  neighborhood  of 
1/2,  so  we  can  find  some  6  >  0  such  that  the  distribution  functions  of 
F"*(ujn^)  and  ejn^  coincide  on  (-6,5).  From  this  and  (2.23),  it  is  seen 


I  fctt  I 


>  .**  _*„>■  _<**  »  I  jttj 


that  the  assertion 


5n  N(0.  o2/4f2(0)) 


(2.24) 


is  equivalent  to 


vn 

ln  -  Z  ni»niF“1CU|n))/^5>  N(0,  a2/4f2(0)) 


(2.25) 


According  to  a  theorem  of  Csorgo  and  Revesz  concerning  the  strong 
approximation  of  quantile  process  (see  [2])  there  exist  independent  N(0,  1/4) 
random  variables  n  . . .  »  such  that 

II  I  Flw_ 

n 

P(|^|(u|n)- j)-nnll  >n‘V2(Al0g  n.  + Z)) <  Be‘cZ,  forlZlfO/^  (2/26) 

where  A,  B,  C,  D  are  positive  absolute  constants.  Choose  Z  »  Slog  n^/c 
and  put  Kj  =  A  +  5/c,  we  have 

c  °° 

l  .1  P(|^l(l>Sn)  -{)  -  n„,l  »  B  /n  n'5/2  < 

n=n_  i-1  o 

o 

Therefore,  wpl  we  have 

|U<">  -  (|  +  nni/^nT)  1  <  Kjn^logn.,  i  -  l,...,cn  '  (2.27) 

for  n  large.  From  this  it  follows  that  (2.25)  is  equivalent  to 


n 

*5  A  i^1nianiF’1(I  +  +  eni}/^  N(0’  °2/4f2(°>)  (2.28) 


where  en^ ,  1  *  1 ,  . . . , 


c„  are  random  variables  such  that 
n 


11 


lenil  -  Klni  1og  ni*  1  s  n  *  1,2 . 


(2.29) 


Since  2n  .  -  N(0,l),  it  is  well  known  that  (see  13],  page  131) 


P(  |nni  >  e)  i  2  -  — -  _  exp(-  |(2/n7 e)2)  < 


7TH  2  /n^  e 


for  i  =  1,  ...»  cn  and  large  n.  Hence  we  have  for  large  n 


I  l  P(|nn1|//iC>  c)  <  I  fie-*'  < 

*n_  i*l  n1  1  n=n. 


which  implies  that  wpl  we  have 


l°ni  1/^*7  -  e»  1  = 


(2.30) 


for  n  large.  Considering  (2.29),  (2.30),  and  the  assumption  made  on  F, 


we  get 


F"1(i  +  "ni7^  +  eni)  ■  fnyy(nni/v^T  +  eni) 

+  I(r  +  eni  )(nni/*/”T  +  0ni)2 


(2.31) 


where  r  *  -f'(0)/(f(0))  ,  and  e  , ,  ...,  e _  are  random  variables  such  that 

m  nc_ 

n 


lim  max(|e  .  |,...,|e  |)  =  0,  a.s. 

n-«°  ''n 


(2.32) 


From  (2.31),  we  can  rewrite  (2.28)  as  follows 


■  Tnl  *  •••  *  T„5 


(2.33) 


where 


Tnl  ' 

Tn2  ’  £wVn19ni/','f(0) 

Tn3  "  ^l"l  I(r  *  cnl>inl"n1/,'",(0) 

Tn4  ■  h>  *  £M)Vn19ninn(/''"f(0) 
Tn5  "  ^1-0  *  Ent*9n1  Vn1/',"f*0'- 


Since  J12iniani /n  ^  °C »  we  have 


Tnl  N(0*  °2/4f2(°)- 


From  (2.29),  one  finds 


n  •!<  Sin  log  n4 
^ Tn2 1  -  (^i  — np-Vf<°>- 


From  (2.17),  by  Schwartz  Inequality, 


(°rn  n1  .  \2  Cn  n.  -  cn  n.  cn  n,  -  - 

l  n  *ni  1/  <  y  —  a2,  l  -1  <  y  -la2,  -  02  <  -. 

1-1  '  1-1 "  ni  (£•  "  -  \.i "  ni 


We  see  that 


supMn^.J/n:  n*1»2»---|  ^  K2  <  "• 


Also,  by  (2.18),  It  is  seen  that 


max{^n  log  n^/n^:  1*1,... ,cfl >  -*-0,  (n  -►  «) 


From  (2.35)-(2.37j ,  one  gets 
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11m  T  ,  -  0. 
n2 


(2.38) 


For  Tn3,  we  note  that  E(nf<)  *  1/4,  so  by  (2.18)  and  (2.36), 


'ni 


n  9  n  r- 

E(  l  Ki\\^  1  l  |anil/v?T  «  -0. 

i*l  n'  n1  i»l  ni  1*1  n  ni 


Considering  this  and  (2.32),  we  get 


Tn3-^°* 


(3.39) 


Tn4  and  Tn5  can  be  handled  In  a  similar  way,  obtaining 


v-2*0-  v-^0-  <»*->• 


(2.40) 


Now  (2.28)  follows  from  (2.33)-(2.35) ,  (2.39),  (2.40).  This  proves  the 
lemma  for  r  *  1. 

In  order  to  prove  the  lemma  for  general  r,  take  arbitrarily  constant 


vector  t  ■  (t.  ,...,t)' ,  then 
1  r 


*'«n  *  Il-l 


where 


an1  *  ^Vnl^*  1  *  1 . cn’ 


(2.41) 


From  (2.19)  and  (2.20),  It  Is  readily  seen  that 

c_ 


Sl-lVrrt  '  °-  n*1-2’ 


Ii"i nl«n1/n  *  l’At- 


Hence,  according  to  the  proved  result  for  the  case  of  r  •  1,  we  have 


4 

4 

r 


t'Cn  -  N(Ot  t* At/4f  (0) ) . 

Since  this  holds  true  for  arbitrarily  chosen  t,  (2.22)  follows,  and  the 
lemma  Is  proved. 

Conditions  of  the  lemma  can  be  somewhat  weakened.  Also,  the  lemma 
can  be  proved  by  resorting  to  classical  methods  of  Central  Limit  Theorem, 
but  verification  of  the  conditions  will  be  quite  complicated. 

2.4.  Proof  of  Theorem  1 

First  note  the  simple  fact  that  If  u^  ■  u  +  tjg  +  h^,  1  •  1,  ...»  k, 
then  there  exists  a  vector  t  In  the  convex  hull  of  (t^ ,...,tk>,  such  that 
med(u^ ,...,uk)  -  u  ♦  t'g  +  med(h^ ,... ,hk ) .  Using  this  fact,  one  sees  that 
there  exists  e  Jn^  (X*.  depends  upon  X^,  Y^,  1  *  1,  ...»  n,  and  a,  b) 


such  that 


*,i  - » *  w  ♦  vi  *  •  ♦  ViB  *  vi +  tx;(  -  Vi''8- 


Therefore,  on  putting  X^  ■  (X*j . X*c  )',  one  verifies  that 

5i0)  -  V  ■  P'nlx(n)V<X?n)  '  X(n),,B- 


(2.42) 


(2.43) 


He  have  shown  In  [1]  that  under  the  assumption  of  the  present  theorem. 


one  has 


11m  P  /n  ■  V,  a.s. 
n 


Also,  the  absolute  value  of  the  (u,  v)  element  of  n  *(n)*n(x{n)  * 


(2.44) 


W 


does  not  exceed 


,Cfi"i ,x»“>  •  T"u)||x(«>  •  - y"“,|/n-  u-,5) 
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-e. 

Here  we  used  the  obvious  fact  that  |XJnj  -  X^nj|  <  n  .  By  an  argument 
similar  to  that  used  In  [1],  It  can  be  shown  that 

cn 

H »  I  nltXn1>  "  ynU>l/n  “  E|X(u)  -  EX(u)|  <  a.s.  (2.46) 

fH*  1  *1 

From  (2.43)-(2.46) ,  it  Is  readily  seen  that  for  any  given  6  >  0,  there 
exists  (finite  constant)  mQ  such  that 

P(|S^0)  -  Bj  1  mQn  €l)  >  1  -  «  (2.47) 

for  n  large. 

Now  it  follows  from  Lemma  1  that 


^T(0n  -  0)  -L  N(o,  V’1/4f2(0)).  (2.48) 

The  argument  Is  as  follows.  By  definition  (2.8),  and  (2.44),  one  sees  that 
(2.48)  Is  equivalent  to 

n*1/2v’lxJn)V(n)  N{0»  v"1/4f^(°))*  (2.49) 


Given  Xj,  X2,  ...  and  consider  the  conditional  distribution  of  Tn  ^ 
n  ^*v"*XjnjWne^nj ,  then  this  Is  just  the  case  studied  In  Lemma  1  with 
r  ■  d,  and 

j  Si*1*  *•*  *ncn^ 

anl(2)  ...  anc  (2) 

n 

•  •  •  •  •  , 

\*n1(d)  •••  *nc,,(d>  / 

It  can  easily  be  verified  that  the  conditions  of  Lemma  1  are  met,  with 


3 


i 


vvy 
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A  *  11m  V“1Xjn)ynX(n)V“1/n  *  V“1VV_1  =  V"1,  a.s. 
n 

So  wpl  (2.49)  holds  true  conditionally  given  Xj,  X^»  ...»  and  it  still 
holds  true  unconditionally.  From  (2.48)  it  follows  that 


^|8n  -  6|  *  Op(l) . 


(2.50) 


Combining  (2.47)  and  (2.50),  one  sees  that  there  exists  mg  such  that  for 
n  large. 


P(|8^  -  6|  <  mn  1)  >  1  -  6,  tj  *  min^.cj) 


-t 


(2.51) 


By  (2.42). 


Yn  *  °  +  +  «„  +  *  * „ )  '  B , 

n  n  n  n  n 


(I;  * 


Hence  by  (2.8)  and  (2.9) 


“i0)  -  °n  •*:<*.  •  “i0))  *  (Xl-lfJ'S. 

n  n  n  n  n  n  n 


-e. 

Since  I  EX  a.s.  and  (If*  -  YJ  <  n  1 ,  from  (2.47)  and  (2.52)  we  get  a 
n  n  n  •  — 

constant  lQ  such  that  for  large  n 


(2.52) 


p(iaj;0)  -  «ni  i  v  6l)  >  1  -  4. 


(2.53) 


Put  k  «  0  in  (2.12),  and  notice  that  Yni(j)  *  a  +  X^(j)6  ♦  e„.,(j). 


we  get 


y'j’ui  ■  x;,9  ♦ .  ♦  ♦  <xn1  -  *b1<j))*(5'0>  •  b). 


'ni 


Again  there  exists  X*f  in  the  convex  hull  of  X^  -  Xn1(j):  j*l,...,n. }, 
such  that 


*i!’  ■  w  ♦■♦•m  *  x;r(®i0)  - s)- 


(2.54) 


Since  |X*J|  _<  n  *,  from  (2.51)  and  (2.54),  It  follows  by  an  argument  used 
earlier  that  there  exists  such  that  for  large  n 

»/ 1  \  •  •(  ) 

P(|bJi  '  -  Bj  1  Wjn  1  1  '  ‘  ’  "* 


)  >  1  -  6. 


(2.55) 


Combining  this  and  the  fact  that  |$n  -  B|  »  0  (iT1'*),  we  find  m1  such  that 
for  large  n 


P( I  Bp1 ^  -  B|  <  «jn  2)  >  1  -  6,  t2  -  m1n(^,  tj+ej). 
From  (2.8),  (2.12)  (setting  k  *  0)  and  (2.54),  one  gets 

-  s„  *  *r<«!i0)  -  B>  -  *„>• 


(2.56) 


(2.57) 


From  (2.51),  (2.55),  and  the  fact  that  |K**|  _<  n  *,  we  find  such  that 
for  any  large  n 


P(  l“n^  -  I  < 


)  >  1  -  6. 


(2.58) 


In  deriving  (2.58)  one  should  also  note  that,  as  shown  above,  the  event 
»/1\  *  "(tj^6.)  */n»  ~  t . 

^I6n  "  6nl  1  min  '  Is  a  consequence  of  { |a':  -  e|  <  mn  A>. 


*(01  —  ~^l 

s  a  consequence  of  (|e^  -  e|  <  mn  ). 


Continuing  this  process,  one  finds  generally  that  there  exists  constants 


m^,  m^  and  l^t  such  that  for  n  large  we  have 
rwiS(k)  »  i _ ~^V*clK 


p{l5nk)  -  5nl  i  V 


)  >  1  -  6 


P(  |6^  -  B|  <  mRn  k+1)  >1-6 

P(lSnk)  -  »„l  i  V’(VE1>>  >  1  *  8 


(2.59) 


(2.60) 


(2.61) 


1  ’ 
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=  niin^,  +  Ej). 


Since  rej  _<  1/2  and  (r  +  1)ej  >  1/2,  we  have  t.  =  iej  for  i  <  r,  and  so 
t  +  Ej  =  (r  +  1)ej,  tk+j  *  1/2.  Therefore,  on  putting  k  =  r  +  1  in 
(2.59)  and  (2.61),  we  get  (2.15). 

In  view  of  (2.15),  (2.14)  is  equivalent  to 


/n 


O' 

6 


■L  Nd+1(0,A"1/4f2(0)) 


(2.62) 


As  en  and  are  linear  functions  of  e(n)»  (2.62)  can  easily  be  proved  by 
using  Lemma  1,  the  argument  is  just  the  same  as  we  employed  in  showing  (2.49) 
This  concludes  the  proof  of  the  theorem. 

The  assertion  (2.14)  still  holds  true  when  r  +  1  in  the  left  hand  side 
of  (2.14)  is  replaced  by  r,  or  by  some  k  >  r  +  1.  But  iterating  beyond 
(r  +  1)  rounds  is  non-prof i table,  in  view  of  the  fact  that  tr+1  *  tr+2  = 

...  =  1/2. 


3.  ESTIMATION  OF  PARAMETERS  IN  TRUNCATED  CASE 


In  this  section  we  study  the  case  in  which  the  dependent  variable 

is  truncated  at  zero.  If  the  original  values  of  Y  are  Y. ,  ....  y  ,  then 

i  n 

actually  we  observe 

Yi  *  Y.I(Y.  >0),  i  =  1 . ,n. 

Introduce  J*  as  we  did  in  Section  2.2.  Choose  constants  c1  >  0, 
e1  c  (e^.e^.l),  where  e.|  has  been  introduced  at  the  beginning  of  Section 
2.2.  Divide  J*  into  three  disjoint  parts.  Let  H..  *  IjI-|((Ynl-(j)  >  0). 

Jnl  *  {Jni :  Hi  >  ni^2  +  c'nt*»  1  =  1  .•••»"> 

E  j 

J*2  =  (J^:  <  n^/2  -  c'ni  ,  i  =  1 . n} 

j;3- j;- 

For  convenience,  we  shall  in  this  section  write  x'y  for  a  +  x'B,  by  intro 
ducing  x  *  ( 1  ,x ' ) *  and  y  *  (a,B')‘.  Me  use  x  and  a  to  replace  x  and  y. 

In  this  way  we  change  a  +  x'B  to  x'o. 

The  following  lemma  will  be  used  in  the  sequel. 


LEMMA  2. 

wpl  we  have  for  any  given 

e2  <  el * 

Jn1 

•  J!i  *  x>  i  "i  • 

i  *  l,...,n 

(3.1) 

-i*i 

*  ^  *xnic‘i -"i 

i  8  1  %  • • •  ,n 

(3.2) 

for  n  sufficiently  large. 


-’♦‘2 


Proof,  Assume  that  X^a  <  n^  ,  then 

-l+ei  -1+ei  -e,  -l+en 

x;i(j)  <  n1  +  n  <  ni  +  ni  <  ni  , 

for  some  eQ  <  e^.  Hence,  in  order  to  have  Jn1  * 


j  =  1 , . . •  ,n^ 
the  inequality 


20 


B 


n. 


Hi  ■  ,LI(eni(j)  ’  '"I 

J  * 


-1+e 


°)  >  in.  +  c'n.1 


*  1  ni  T  «•  ni 


must  be  true.  On  the  other  hand,  from  the  assumptions  made  on  F  (see 
Section  2.1),  one  can  find  constant  c"  >  0  such  that 


a  -1+en  ~1+en 

P  -  P(eni(j)  >  -n.  °)  <  1/2  +  c"n.  °. 


Using  Hoeffding's  inequality  12],  and  abserving  that 


ty  <  1/2  ^e*  >  1  -  e.|  >  1/2,  n.  >  cQn  c  (see  (2.7)) , 
we  get  for  n  large 


-l+e«  -l+en 

P*(Jni  *  ^l5  -  P*(|Hi/ni  _pl  -  c‘ni  -  c"ni  > 


i  1  -1+ei  9  ,  ,, 

1  P*(|Hi/ni  -  p|  >  jc'n^  ')  <  2exp(-ni(^c’ni  'r/3)<n"’J  (3,3) 


simultaneously  for  i  *  1,  ....  cn,  where  P*  *  P*(X^  ,X2... .)  is  the  conditional 
distribution  given  X^ ,  X^,  ...  .  Since  (3.3)  holds  for  each  (X^.X^,...), 
we  get  for  n  large 


f>(Jni  «  JSl>  i  n 


-3 


(3.4) 


simultaneously  for  i  =  1,  ...,  cn.  Introduce  the  event 


■1+eJ 


E..  =  {for  some  i  =  l,...,c  .  X  .a  <  n.  but  J  .  e  J*,}. 
n  n  m  i  m  ni 


3  -2 

Then  since  cn  <  n,  we  have  P(En)  <  cn/n  <  n  ,  yielding 


P(En  i  ,0. )  =  0 


-1+eJ 


which  means  that  wpl  X'.a  <  n.  c  +  J.  e  J*,  for  all  i  =1,  . ..,  c 

ni  i  ni  ni  n 

and  n  sufficiently  large.  This  is  just  (3.1).  (3.2)  can  be  proved  in  a 

similar  fashion. 
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Z.l  Estimation  Using  Only  J*.  - cells 

If  a  cell  Jni-  belongs  to  J*-j ,  then,  although  the  observations  of  the 
dependent  variable  related  to  this  cell  might  have  been  truncated,  the  med¬ 
ian  of  the  original  observations  can  still  be  calculated.  Therefore  the 
method  of  the  previous  section  can  be  applied  to  the  collection  of  these 
cells,  yielding  an  estimate  for  a. 

In  order  to  avoid  the  introduction  of  numerous  new  notations,  from 

now  on  in  this  section  we  shall  redefine  ,  ...,  Jnc  as  the  elements  in 

n 

.  Other  notations  in  Section  2,  too,  are  redefined  in  accordance  with 
this  change.  For  instance,  the  symbol  Nn  should  be  understood  as 

N"  =  I(1=Jni6jSl>"r 

Ending  this  process  we  get  a  redefined  estimate  of  a  (the  original  (a,B‘)), 

«( f*+i  1 

which  we  now  denote  by 

For  this  estimate  the  following  theorem  is  true: 

i 

i 

THEOREM  2.  Suppose  in  addition  to  the  conditions  of  Theorem  1  that  | 


P( X 'a  >  0)  >  0. 

(3.5) 

V  =  COV ( X | X 'a  >  0)  >  0. 

(3.6) 

Then,  as  n  -*■  »,  we  have 

*^(Snr+1)  -  a)  — L  N(0,  v_1/4f2(0)). 

Proof.  On  account  of  Lemma  2,  this  theorem  can  be  proved  by  largely 
the  same  method  employed  in  proving  Theorem  1.  So  the  details  are  omitted. 


mmsmsmsmim 


%  a  „*\  , 
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3.2  Tobit-Type  Estimate 


In  this  subsection,  in  addition  to  the  cells  in  ,  use  will  be  made 


on  cells  belonging  to  J*2  in  order  to  form  a  Tobit-type  estimator  for  a.  It 


is  believed  that  by  so  doing  we  are  able  to  make  some  improvements  on  a 


(r+1) 


discussed  earlier.  As  Nawata  declared  in  [5],  his  simulation  results  in  some 
cases  seem  to  give  support  to  this  belief.  Theoretically,  the  problem  is 
complicated  as  the  probable  improvements  are  likely  to  depend  on  actual  situa¬ 
tions  (underlying  distributions ,  sample  sizes,  method  of  decomposition  of  the 
range  of  independent  variables,  etc.)  and  would  be  difficult  to  justify  in  a 
reasonably  general  setting. 

Now  use  ,  ....  Jnd  to  denote  the  cells  belonging  to  The  center 

-  n 

of  Jni  will  be  denoted  by  X^. ,  i  =1,  . ...  dn>  Put  m-  =  #(Jni)  (the  number 
of  elements  in  Jnj  n  {Xj ,. . .  ,X  }) ,  and 


L(a,a)  =  n  *(-vCX>/a)  II  a  ^exp[-n.Y^+^  -  X'.a)^/2a^]  (3.7) 

•  l-ni  •  n  l  ni 


where  $  is  the  distribution  function  of  N(0,1), 


If  (an»°n)  maximizes  L(a,a),  we  use  a*  as  an  estimate  of  a. 


This  kind 


of  estimate  was  first  considered  by  Tobin  [9]. 
We  shall  prove  the  following  theorem. 


THEOREM  3.  Suppose  that  in  addition  to  the  conditions  of  Theorem  2, 
we  have 


i  2+6 


E | X ]  <  »  for  some  6  >  0. 

Choose  e-j  <  6/(4 +  26)  (see  the  beginning  of  Section  2.2),  and  e£  in  (3.1),  that 


e'2  >  1  -  6/(4  26) 


(3.8) 
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Then,  as  n  -*•  «,  we  have 

_L>  N(Ot  v/4f2(0))  (3.9) 

where  V  is  defined  in  (3.6). 

This  theorem  indicates  that  in  the  asymptotic  sense  the  Tobit-type 
estimator  a*  makes  no  improvement  over  a^r+^,  which  is  the  ordinary  LS  esti¬ 
mator  based  upon  only  the  cells  in  .  Needless  to  say  that  in  practical 
applications  the  sample  size  n  may  not  necessarily  be  large.  In  such  cases 
the  question  remains  as  to  which  one  is  superior  over  the  other. 

In  defining  a*  we  make  no  use  of  those  cells  which  do  not  belong  to 
j*l  u  J*2 "  ^rom  a  Practical  point  of  view  this  poses  no  serious  problem,  as 
we  always  can  choose  Cq,  e2,  c-j ,  e j  small  enough  to  allow  the  inclusion  of 
more  cells.  Theoretically  speaking,  as  long  as  P(X'a»0)  =  0  (which  is  the 
case  when  X  is  non-atomic),  the  proportion  of  sample  points  not  used  in  the 
definition  of  a*  goes  to  zero  as  n  -►  ®.  Nevertheless,  it  is  interesting  to 
ask  whether  or  not  it  is  possible  to  invent  a  trick  which  enables  us  to  use 
all  sample  points  in  the  definition  of  a*,  while  allowing  the  number  of  cells 
to  go  to  infinity  and  retains  the  basic  asymptotic  property  of  a*  as  described 
in  Theorem  3. 

The  proof  of  Theorem  3  will  be  preceeded  by  several  lemmas. 

LEMMA  3.  Suppose  that  C-j .  •••  a  sequence  of  iid.  random  vari¬ 

ables,  and  E|c-j|a  <  00  for  some  a  >  0.  Then 

lim  n"1/famax(  Ic-. ! ,. ..  ,U_| )  =  0,  a.s.  (3.10) 

n-x»  *  n 

Proof  is  simple. 

i 

i 

LEMMA  4.  Denote  the  res'idual  sum  of  squares  by 

1 

i 
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R  =  l  (Y(r+1)-X'.J<r+1))2 

n  m  n 


(3.11) 


Then,  under  the  conditions  of  Theorem  2,  we  have  wpl 


where 


*  (4^(0))*’ 


P*  *  P*(X^  .Xg.. ..)  =  the  conditional  probability 

measure  given  X^ ,  X^,  ...  . 


(3.12) 


(3.13) 


(3.14) 


Proof.  We  proceed  to  show  that  WP1  there  exists  random  variable 


nn  ”  xc  -d*  such  that 


Vo  -  %  -  V’ ^"'e,)  «• 


From  this,  (3.12)  follows  at  once. 

In  order  to  prove  (3.15),  we  rewrite  as 


(3.15) 


_  v(r+1 ) *  l 


’1  Y  •  U  ) 


Rn  ■  . <“n-V(n)p»  X(n)Wn’Vn 


Notations  involved  are  defined  in  Section  2.2.  Put 

Zni  =  Xnia  +  eni  ’  1  =  ’••"•V  =  (Znl . Zncn^‘ 

It  is  not  difficult  to  see  by  definition  (2.10)  and  Theorem  2  that 


(3.16) 


<r+1)  ■  zn  4  V 


cn  =  *cnl . Cncj' 

n 


(3.17) 


where  »  . ...  £nc  are  random  variables  uniformly  (ip  i)  of  the  order 

n 
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-0/2+eJ 

V" 


as  n  -►  <*>,  which  means  that  for  arbitrarily  given  e  >  0,  a 


constant  M£  exists  so  that  for  n  large 

-(I /2+e, ) 

P(Unil  1  Me n  ,  1  =  1 . c  )  >  1  -  e. 


Put 


R„  ■  zA<“n-Wnx(n)P;1x{n)Wn>z„- 


(3.18) 


(3.19) 


Then  by  exactly  the  same  way  as  in  Theorem  1  of  [1],  we  can  show  that  wpl 

2 
'n 


there  exists  nn  -  x*  _d.  such  that 


Rn/o0  ‘  °- 


(3.20) 


This  is  true  because  the  strong  approximation  of  e  .  in  [1]  is  valid  to  e  . 

n  1  ni 

in  this  paper  also,  as  we  indicated  in  Lemma  1.  Now 


IVR„I  ^  «;Vn  +  2  e(n)(V  V(n)P'n1x(n)“n>«nl 

From  (3.18)  we  have 


(3.21) 


■2e 


W.  =  V"  ’>• 


(3.22) 


By  Schwartz  inequality,  writing  Qn  *  Wn  -  wnx(n)Ppls(n)wn»  we  get 

(e(n)Vn)2  i  «(n)V(n)-5;Vn 
^  '(nlVlnl-Wn 


•  *  Rn'enwn£n* 


From  this  and  (3.20),  (3.22),  we  have 


-2c, 


<*(n)VJ2’VV  ’K 

Now  (3.15)  follows  from  (3.20)-(3.23)  and  Lemma  4  is  proved. 


(3.23) 


LEMMA  5.  Under  the  conditions  of  Theorem  3,  the  sequence  {o*}  is 
bounded  in  probability. 

Proof.  First  we  make  an  estimate  on  L(a^r+^,  on).  For  this  purpose, 
note  that  by  Lemma  2,  wpl  we  have 

-1+ei 


-^ia±"i  •  1-1 . cn 


(3.24) 


for  n  large.  By  Theorem  2,  a  -  a^r+^  =  Op(n’^2),  and  by  Lemma  3  (consider- 

o  4.x 

ing  that  E|X|  <  «)  for  arbitrarily  given  t  >  0,  we  have  for  n  large 

P(|X‘ni(a-^r+1))|  <  n'6/(4+26),  i«l . cn)  >  1  -  e.  (3.25) 

By  the  choice  of  e£»  *1  +  e2  >  "6/(^+26) .  Hence  from  (3.24)  and  (3.25),  we 
have  for  n  large 

P(X^r+T)  <  -  {m^  2,  i«l . cn)  >  1  -  e.  (3.26) 

Combining  this  with  (3.12),  we  have  for  n  large 

-1+e 1 

P^iV+1)  -  *  7mi  2.  I-1 -  .Cn;  on  <  2a0)  >  1  -  e.  (3.27) 

e2  1 

Since  m^  _>  cQn  (see  (2.7)),  and  e£  >  1  -  <5/(4+26)  >  we  have 


and 


■1+e: 


a  -  1  -  2(1  -c£)  >  0 


e-a 


'2  2  a  *■?" 
m^mj  )  »  m^  >  cQn  ,  1  »  l,...,d  . 

Since  4(t)  _>  1  -  ( /2tt  t)“^exp  (-  t2/2)  for  t  >  0,  and  1  og(  1  -  x)  >  -2x  for 


x  >  0  sufficiently  small.  We  see  that,  in  case  the  event  appearing  in  the 
left  hand  side  of  (3.27)  occurs,  we  have 
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1o9  i  -2<ln  «p(-  VE2a/8o2)/(/2,c0nC2*o0) 


.  I( 

^  -n  -*-0,  as  n  -*>  »  for  any  k  >  0. 


(3.28) 


Therefore,  for  arbitrarily  given  e  >  0,  when  n  is  sufficiently  large, 
we  have 


pan*1"*1*.  0„)  >  ^  o’Cne'c"/2)  >  1  -  e. 

But  if  o  >  v*e  a  ,  we  shall  have 
n 

-C  .  -c  -c  /2 
L(a,  a)  ±  o  <  "?°n  0  e 

for  any  a  and  n  large.  From  this  fact  and  (3.29),  we  see  that 

P(o'  <  2v*e  o  )  >  1  -  e 
n  n 

for  n  large,  and  this  concludes  the  proof  of  the  lemma. 


(3.29) 


(3.30) 


Now  we  can  prove  Theorem  3.  Given  e  >  0,  for  any  aQ  with  ||ag-a^ 
>_  e/«G,  we  have 

*  1  WyN)  vi  \2 

»  T  Z  (».<  -  xnjagl 


(r+1 ) 


109  L(o0.o;)  <  -"logo;  -  — T  l  (y; 

2a*  i»l 


•ologc;  -R„/2o;2  - 


Me  recall  that  Pn  *  n )wnx( n) •  Since  Pn/n  -*•  A  *  C0V(X|X*a>0)  >  0,  we  get 
wpl  for  n  large 

log  l(a0.o*)  i  -n  logo*  -  Rn/2o*?  -  Xe2/ 2  (3.31) 

simultaneously  for  all  aQ  such  that  ||ag-  >  z/J n,  where  \  >  0  is 

the  smallest  eigenvalue  of  a. 
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On  the  other  hand,  (3.28)  still  holds  true  when  Is  replaced  by  any 
o'  >  0.  The  convergence  to  zero  would  be  uniform  for  o’  <_  2oq,  in  case 
that  the  event  appearing  in  the  left  hand  side  occurs.  Therefore,  in  cases 
that  the  events  appearing  in  the  left  hand  side  of  (3.26)  and  (3.30)  both 
occur,  we  shall  have 

log  L(aJr+1),0;)  >  -  log  o*  -  y2aj2  ’  (3.32) 

where  lim  e  *  0.  From  (3.31)  and  (3.32),  we  get 
n-*«  n 

P(sup{L(a0,  o*)  :  ||aQ  -  “pP+1  ^  II  >  t/t/n}  <  l(aj‘r+1  ^ ,  o*) )  >  1  -  2e 
for  n  large.  This  implies  that 

P(  ||a*  -  a^r+1  ^  ||  _>  e/Sn)  <  2 e 

for  n  large.  Therefore 

0.  (n--).  (3.33) 

n  n 

Now  (3.9)  follows  from  Theorem  2  and  (3.33).  This  concludes  the  proof  of 
Theorem  3. 

2 

3.3  Estimation  of  Og 

Under  the  method  of  estimation  of  the  present  paper,  from  a  large-sample 
2 

point  of  view,  oQ  defined  In  (3.13)  plays  the  role  of  error  variance. 

2 

similar  to  the  case  of  a,  we  can  define  two  estimates  of  oQ.  One  is 

2 

ofl,  which  uses  only  those  cells  in  and  is  the  common  estimate  of  error 

2 

variance  based  on  the  residual  sum  of  squares.  Another  is  o*  ,  which  is  a 
kind  of  maximum  likelihood  estimate  in  the  Tobit  model.  The  following  lemma 
reveals  that  these  two  are  asymptotically  equivalent. 
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LEF94A  6.  Under  the  condition  of  Theorem  3,  we  have 


n*(o*  -  on)  ■*  0,  a.s.  (n  •) 


(3.34) 


for  any  constant  a  >  0. 


Proof.  By  Lemma  5,  (3.28),  we  have  wpl 


log  ^•(-.firj'X^jaQ/oJ)  ~  l09  ^♦(-^1S<r+1>/on)  <  fl‘k  (3.35) 


for  n  large,  where  k  is  arbitrarily  given.  Further 


rn  *  109  1f)^-,«pt-ni(Y<;4,'.x;i.;)1!/2o;2] 

- 109 °n’ 


cn  c 

- T  I  n.(r'p’,-x:,a*)2.  1  r"  ,v(r*l)  J-.  -(r+1) 

2^  i-i  1  n1  n  57  4i 1 ’n1  Xn1 "  ‘ 


-  c  log  o*  +  c  log  o  . 
n  *  n  n  n 


Since 


Cn  Cn 

V  n  fy(r+1)  *•  J)2  >  T  n  fv(r+1)  *•  ,;(r+1)\2  D 

(i,Vyn1  "  Xn1  n>  i  ,;)n1lYn1  Xn1a„  >  *  V 


we  have 


Tn  <  RJanZ'an^(2anan2^  “  C„  ^(^/O 

n  —  n  n  n  nn  n  nn 


•  cn[(1  -x2)/2  +  log  x]  <  -cn|x  -  1  (2/2 


(3.36) 


wh^e  x  «  ojo *.  Hence,  If  |o„/o*-1|  >  en*a,  then,  by  (3.35)  and  (3.36), 
n  n  n  n  — 

we  shall  have,  on  taking  k  *  2a  ♦  1  in  (3.35),  that 


■ti 


r'A 


(3.37) 


log  L(a*tO*)  -  log  L(a vn  ,on)  <0 

for  n  large.  But  (3.37)  Is  impossible  as  (a*, a*)  maximize  L(a,a).  This 
shows  that  wpl  we  have 

|na(oj-0n)|  <  e 

for  n  large,  and  (3.34)  is  proved. 

THEOREM  4.  Under  the  conditions  of  Theorem  3: 

1°.  If  X  is  purely  atomic  with  c  distinct  atoms,  d  <  c  <  «■>,  then  as 

n  -*■  • 


°V°l  - 

(3.38) 

other  cases  we  have  as  n  -*•  • 

(H.0) 

(3.39) 

(N.o) 

(3.40) 

^vl/oo  *  N(0»1) 

(3.41) 

-  r4(cn  -  d)  -L  N(0,  1 ). 

(3.42) 

Proof.  In  case  1°  we  have  wpl  cfl  ■  c  for  n  large.  By  (3.15),  wpl, 

2  2  L  2 

under  P*  we  have  o  /oZ  -=-*  x‘  ..  Hence  this  is  also  true  unconditionally. 

n  u  c-u 

This  proves  the  first  assertion  of  (3.38).  The  second  follows  from  the 
first  and  Lemma  6. 

In  case  2°  we  have  cR  -*•  »,  a.s.  From  (3.15)  and  the  central  limit  theorem, 
wpl,  under  P*  we  have  (3.39).  So  (3.39)  Is  still  true  unconditionally.  (3.40) 
follows  from  (3.39)  and  Lemma  6. 
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(3.41)  follows  from  (3.15),  and  the  following  two  facts: 

a)  if  ?n  -  xjj,  then  -  v'Zn  N(0, 1) ,  as  n  -*•  », 

b)  ^)T+  a(x)  -  i/x  +  0,  as  x  -*•  “  and  lim  a(x)//x  =  0. 

x-*» 

(3.42)  follows  from  (3.34)  and  (3.41). 


.’■•TAV*? 


4.  TESTING  OF  LINEARITY 


In  practical  applications  we  are  often  not  sure  that  the  regression 
function  (the  conditional  median  of  Y  given  X)  is  linear,  and  a  test  for 
this  hypothesis  is  desirable.  In  this  section  we  shall  propose  such  a  test. 

The  idea  behind  the  test  is  quite  simple  and  is  similar  to  the  one 
proposed  In  [1],  where  the  regression  function  is  defined  as  E(Y|X  =  x)  and 
no  truncation  is  allowed.  From  now  on  we  use  Hq  to  denote  the  linear  hy¬ 
pothesis  (2.3). 

If  (2.3)  is  not  rure,  then  the  residual  sum  of  squares  Rn,  defined  by 
(3.11),  tends  to  become  larger.  Therefore  a  reasonable  test  of  Hq  is  to 
reject  it  when 

Rn  >  C  (4.1) 

for  some  C,  and  accept  it  otherwise.  C  is  chosen  according  to  the  pre- 

2 

assigned  size  aQ.  In  order  to  do  this,  we  have  to  find  an  estimate  on  of 
oq  *  (l/4f2(0))  such  that  (3.15)  still  holds  true  when  Oq  is  replaced  by 
3n,  under  HQ.  For  if  such  an  estimate  o|j  has  been  found,  then  (3.41) 

2  9 

remains  valid  when  oQ  is  replaced  by  SJj  (under  Hq) ,  and  we  can  choose 

C  «  32(/2(c-d)  +  u  )2/2  (4.2) 

n  n  “o 


where  u  is  defined  by  *(u  )  *  1  -  an.  The  test  (4.1)  is  asymptotically 

°0  a0  0 
similar  with  size  cxq. 

The  problem  of  estimating  Oq  Is  reduced  to  the  problem  of  f(0),  the 

value  of  the  density  function  of  e^  at  zero. 

-2  2 

It  is  easy  to  see  that  if  an  estimate  of  Oq  satisfies 


(4.3) 
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K 


p  >*!**♦  »  «■*  t  »  .'i  t  * 


Then  o*  will  have  the  property  required  in  Section  4.1.  It  is  obvious 


that  if  we  can  find  an  estimate  fn(0)  of  f (0)  such  that 


^(fn(0)  -  f(0)  0, 


(4.4) 


then  ojj  =  (4f n(0) satisfies  (4.3). 

Choose  ej  e  (0,  l/3d)  in  the  definition  of  J*  in  Section  2.2.  Since 
£-|  <  1/3,  we  have  e-j  >  2/3  in  the  definition  of  J*^.  Take  >  1  -  e-j  in 
(3.1),  then  1  -  e’  <  1/3. 

2 

Choose  eg  e  (0,  U2*f)/4)  (see  (2.5))  and  cQ  >  0.  Select  out  such 
cells  I  in  J^n  satisfying  the  condition 


x  e  I  =*  |x|  <  cQn  . 


(4.5) 


For  convenience  we  shall  denote  all  these  cells  by  J  .,  ....  J  , . 

ni  ncn 


Define 


!n1  =  U:  |Yni(j)-X'.(j)^r+1)l  <n‘1/3,  j  =  l . n.>,  i  =  l . c^. 


Since 


(4.6) 


and  from  Theorem  2  we  have 


s‘rfl)  -  0p(l),  l<r+1)-«l  *0p(n-'/2). 


Also.  IX^UJ-X^I  <  n  |X^|  <  c„n  0  for  1  =  1 . c^,  and  by 


-’S 


Lemma  2,  wpl  X^a  >  n  ,  i  s  1,  ...»  cn  for  n  large.  We  see  from  (4.6) 


lim  P(E  )  =  1 
n-+«  n 


(4.7) 
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where  E„  is  the  event 
n 

En  *  {J  6  Ini  'for  some  i  =  ^  Ynl- (j)>0}.  (4.8) 

When  E  occurs,  the  number  of  elements  g  -  in  I  .  can  be  calculated  from 
n  m  m 

the  truncated  observations  of  the  dependent  variable  Y,  and  the  quantity 

ATCn 

9n  “  L.  ,g  • 
n  islsm 

is  well  defined  in  En  (can  be  calculated  from  the  truncated  samples  when 

E„  occurs), 
n 


Since 


|(Vni(J)-Xi,(j)S<rt1))-en1(j)|  <  |Xnf(j)||S<r+’)-0|d. 


there  exists  constant  A  such  that 


lim  P(E  )  *  1 


(4.9) 


where 


j  *  1.  ...»ni,  i  *1 . cn>. 


Now  define  an  estimate  of  f (0)  as  follows: 


-1/2+e, 


(4.10) 


f„<°)  • 


9n/(2n~^3N^) ,  when  En  occurs 
0  ,  otherwise 


N:  -  n,  +  ...  +  n  , 
n 


(4.11) 


and  proceed  to  show  that  this  estimate  satisfies  (4.4).  For  this  purpose. 


gn(a,b)  =  the  number  of  elements  in  the  set 


{ ( i » j ) :  a<e  •(j)<b,  j  =  l,...,n. *  i  =  l,...,cn) 


and  define 


fnl(0)  =  gn(_n’1/3»  n'1/3)/(2n'1/3N|;) 

f#2(0)  -  9n(-n-'/3  +  n2£°'1/2,  „-V3.nV1/2)/(2n-l/3N,) 

f„3<0)  -  „-V3  +  >1/3)/(2„-1/3n.). 

From  the  well-known  result  in  the  theory  of  density  estimation  (see  [8], 
Chapter  2)  and  the  easy  fact  that 


lim  inf  N'/n  >0,  a.s. » 
n-*°°  ” 


under  the  assumption  of  Section  2.1,  we  have 

fnl(0)  -  f(0)  =  0p(n‘1/3), 

Since 

2e0-l/6 

fn2(0)  =  (l+0(n  0  ))fn- 

2e0-l/6 

fn3(0)  =  d+°(n  °  »V 


(4.12) 


(4.13) 


from  (4.13)  we  have 


2e«-l/6 

(0) 

= 

(1 

+  0(n  u  ))fnl(0) 

2crt“l/6 

(0) 

(1 

+  0(n  u  ))fn](0) 

2en- 

■1/6 

fn2 

(0) 

- 

f(0)  =  0p(n  0 

) 

2eq* 

-1/6 

fn3 

(0) 

- 

f(0)  =  0p(n  0 

) 

fn3(0)  ”  f<°>  a  V"  )•  (4.14) 

On  the  other  hand,  it  is  easy  to  see  that  when  n  is  large  and  the  event 

EH  occurs,  we  have 
n  n 
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Therefore,  from  (4.7),  (4.9)  and  (4.14),  we  get 

2en-l/6 

fn(0)  -  f(0)  =  0p(n  U  ).  (4.15) 

l/2-e«/2 

But  Jc^  -  0p(n  )  (see  (2.6)),  and  since  eQ  <  (e2-2/3)/4,  we  have 

1/6  -  2eg  >  1/2  -  e2/2.  From  this  and  (4.15),  we  finally  get  (4.4). 
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